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The internal dynamics of macro-molecular systems is characterized by widely separated time scales, ranging 
from fraction of ps to ns. In ordinary molecular dynamics simulations, the elementary time step At used to 
integrate the equation of motion needs to be chosen much smaller of the shortest time scale, in order not to 
cut-off important physical effects. We show that, in systems obeying the over-damped Langevin Eq., the fast 
molecular dynamics which occurs at time scales smaller than At can be analytically integrated out and gives raise 
to a time-dependent correction to the diffusion coefficient, which we rigorously compute. The resulting effective 
Langevin equation describes by construction the same long-time dynamics, but has a lower time resolution 
power, hence it can be integrated using larger time steps At. We illustrate and validate this method by studying 
the diffusion of a point-particle in a one-dimensional toy-model and the denaturation of a protein. 



I. INTRODUCTION 



Molecular Dynamics (MD) simulations are playing an increasingly important role in contemporary biophysics, biochemistry 
and molecular biology, as they allow for an atomistic level of description of many fundamental molecular processes. Unfortu- 
nately, when the system is very large, or when the reaction under investigation is very slow, the computational cost of the MD 
simulations can be extremely large. Hence, a large effort is being invested by several groups, in order to develop alternative 
theoretical approaches or improved numerical algorithms. Examples include reaction path sampling methods (UISI, Markov 
state models |6-8|, projection techniques |9|, adaptive time-step MD 1 10] and temperature accelerated MD | 111, to name a few. 

The inefficiency of MD simulations to investigate the long-time dynamics is related to the co-existence of widely separated 
time scales in molecular systems. Eor example, while the time scale for equilibrium oscillations of covalent bonds is of the 
order of the fraction of ps, the time scale associated to the rotation of dihedral angles in a poly-peptide chain is of the order of 
ns. Clearly, in order to perform realistic MD simulations, one has to use integration time steps which are much shorter than the 
shortest time scale, hence typically in the fs range. 

In a recent work |12] it was shown that such a separation of time scales can be exploited in order to rigorously derive 
an effective stochastic theory (EST) which generates by construction the same long-time dynamics of the ordinary Langevin 
equation (LE), at a lower time resolution power. The basic idea of the EST consists in using the path integral formalism and 
Renormalization Group (RG) techniques, in order to systematically and analytically perform the integral over the fast Fourier 
components of the Langevin trajectories. The advantage of this procedure is that the low time resolution EST can be simulated 
using larger discretization time steps. It is important to emphasize that the EST is formulated in terms of the same degrees of 
freedom of the original theory (e.g. the atomic coordinates), hence not rely on any choice of reaction coordinate. 

In Il l2iil the EST was formulated in terms of a stochastic path integral and tested on a simple one-dimensional toy model, using 
a Monte Carlo algorithm. The main result of the present work is to formulate such an effective theory in terms of an effective 
Langevin equation, which contains a time-dependent diffusion constant. Such an equation can be straightforwardly integrated 
using a standard Ito rule, and adopting large time steps A^. 

In order to illustrate and validate the present approach, we apply it to simulate the stochastic dynamics of two test- systems: 
a simple one-dimensional toy model and a small protein fragment. In both cases we find that the effective Langevin equation 
yields the correct long-time evolution of the system, even when one adopts an integration time step 20 times larger than that used 
to integrate the original LE. 

The paper is organized as follows. In section[II]we review the path integral formulation of the ordinary over-damped Langevin 
dynamics. In section III we summarize the construction of the path integral which defines the EST. In section IV we show 
how such an effective theory can be equivalently formulated in terms of an effective Langevin equation, in which the physical 
effects associated to the fast dynamics is replaced by an effective diffusion coefficient. In section [V] we present the illustrative 



applications of the present approach. Our conclusions are summarized in section VI 
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II. PATH INTEGRAL FORMULATION OF THE OVER-DAMPED LANGEVIN DYNAMICS 



Let us begin by reviewing the path integral formulation of the stochastic dynamics generated by the ordinary over-damped 
Langevin Eq. 

x = -^vu{x)+^{t). (1) 

Here X = (xi, . . .jX^p) is a point in the configuration space of A^^ particles, Dq is the diffusion constant and r{{t) is a stochastic 
force with zero average, obeying fluctuation-dissipation relationship 

(il?(Oil'(0)) = 2£)o5(f)5o-8''', {i,j=l,...,Np a,b=l,2,3). (2) 

The stochastic differential Eq. ([T]) needs to be complemented by a prescription which provides a definition of the time 
derivative of the stochastic variable X. A choice which is commonly adopted in numerical simulations is the so-called Ito 
prescription, in which Eq. ([T]) is defined as the limit of the discretized equation 

X/+1 - X/ = -^^VU{Xi) + V^DoAt R/, (3) 
Kb I 

where X/ represents the configuration at the /— th time step and R/ is a Gaussian noise vector with zero average and variance 
given by 

{i^,R'j,)=dijd,rd^'. (4) 

We note that the Eq. ^ defines a Markovian process, i.e. the probability distribution for the configuration at time step / + 1 
is completely determined by the configuration at time step /. The term ^/2DoAt Rf is the random (Brownian) displacement in 
configuration space, after an elementary time interval A^. 

Let us now compute the probability a given path, i.e. of a specific sequence of Nt conformations X(x) = (Xi,X2, . . . ,Xa/^), 
generated by iterating Nt times Eq. ([3]). To this end, we follow closely the discussion in |4 | and first compute the (normalized) 
probability density of generating a given string of random numbers (Ri , R2, . . . , Ra/^ ) 



3^ 



YldRi (5) 

i=i 



Next, we use Eq. ^ to relate the probability density of such a sequence of random numbers to the probability density of the 
sequence of configurations (Xi,X2, . . . ,Xa/^). After substituting ^ into the exponents and computing the Jacobian of such a 
transformation, one arrives to: 



fP(Xi, . . . ,X^J = const. X e '^0^ Yl'^Xt. (6) 



The (un-normalized) conditional probability of visiting the configuration X^^ after A^^ steps, starting from the configuration 
Xi reads: 

^ /Nt \ -i:fl7'(^/+i-^/+^o 
P{X,\XNp',NtAt) = J \J[dXi^ e 4^oA^ (7) 



In the continuum limit, 

A^. ^ 00, (8) 
NtAt = t (fixed), (9) 

the expansion of the square in the exponent of Eq. ^ contains the definition of the integral in the so-called Ito stochastic 
Calculus: 

(/) VU{X) • dX = lim (X/+1 - X/) • VU{Xi) . (10) 
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Due to the stochastic nature of the variable X, the fundamental theorem of the Ito Calculus differs from the one of the Riemann 
Calculus, and reads — see e.g. the discussion in L14J — 

(/) VU{X)-dX = U{XN)-U{X,)-Do f dxV^U{X{x)), (11) 
Jxi Jo 

where the integral appearing on the right-hand- side is an ordinary Riemann integral. Notice that, in the limit Dq ^ limit, one 
recovers the conventional fundamental theorem of Calculus. 

Using this theorem, in the continuum limit the conditional probability P{XN\Xi;t = NAt) can be expressed as a path integral 

/'(X^|Xi;r) = e-^(^(^-)-^('^'» r"2)Xe-^^//[^l, (12) 

JXi 

where 



is called the effective action and 



%/[X(0] =j/^{^^+ (13) 
^.//(X) = -^^^ ((VC/(X))2 - 2kBTV^U{X)) (14) 

is called the effective potential. 

Incidentally, we note that the same expression ([12]) can be obtained directly from the Fokker-Planck Eq., without having 
to define a stochastic Calculus — see e.g. the discussion in |T| — . In general, it can been shown that the probability density 
generated by the Langevin Eq. with a constant diffusion coefficient Do is independent on the convention adopted to define the 
stochastic Calculus. As we shall see in section |IVj this is not the case for stochastic differential equations, with a multiplicative 
noise. 



III. THE EFFECTIVE PATH INTEGRAL FOR THE LONG-TIME STOCHASTIC DYNAMICS 



In this section, we sketch the derivation of the EST, which formulates the stochastic dynamics described by the path integral 
( p^ at a lower time resolution power. For all further details we refer the reader to the original paper 1 12 |. 

For simplicity and without loss of generality, it is convenient to consider the path integral with periodic boundary conditions 

Z(0 = j dXP{X\Xu) = ^ "DX e-^^ff^^l (15) 

Let us introduce the Fourier components of the paths, 

X(co^) = - f di:X{x) e-'"^-' (16) 
t Jo 

X(x) = X(x + 0=L^(^-)^'''"'- (17) 



where co„ are the Fourier frequencies: 



2n 

(i)n=Y^^ ^ = 0,±1,±2,.... (18) 



The path integral ( 15 ) is defined in the continuum limit. In practice, numerical simulations are always performed using a finite 
discretization time step At. Clearly, the shortest time intervals which can be explored in a numerical simulation is of the order of 
few At. Equivalently, the largest frequencies of the Fourier transform of the stochastic paths X(ol)) are of the order few fractions 
of an ultra-violet (UV) cut-off 

Q. = 2'K/At. 

In order to exploit the decoupling of the internal time scales in molecular systems, it is convenient to split the Fourier modes 



of the paths contributing to ( 12) in high-frequency — or "fast" — modes and low-frequency — or "slow" — modes. In this case. 



a real number < Z? < 1 can be defined such that the frequency range (0,^1) is split in two intervals (0,/? Q) U (b ^1,^1). 
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Correspondingly, one can define the "fast" component of the path X>(x) and the "slow" component of the path X<(x), by 
summing over the Fourier modes in the (0, b Q) and (b range, respectively: 

X<(0 = £ X(o)„)e'"''' (19) 

\tS)n\<bSl 

X>(0 = ^ X(co„)^''""^ (20) 



The complete path integral ( 15 ) can therefore be written in the following way: 



z{t) = ^^DX<^^DX> ^"^^//[^<+^> 



/ (DX< ^-V/[x<] e-^>^^<\ (21) 



where 

is called the renormalized part of the effective action. 

The EST is constructed by explicitly evaluating 5'>[X<], i.e. by performing the path integral over fast modes X>(x). In the 
limit in which the fast and slow modes are separated by a large gap in the spectrum of Fourier modes — i.e. if the system displays 
a decoupling of time scales — such an integral can be carried out analytically in a perturbative approach based on Feynmann 
diagram techniquesi 12 |. The expansion parameter such a perturbation theory is the ratio between the typical frequency CO of the 
slow modes and the UV cut-off bQ.. Clearly, if hard and slow modes are decoupled, the ratio co/ {bQ.) is a small number, hence 
the terms proportional to higher and higher powers L of such a ratio provide smaller and smaller corrections. 

If one accounts only for corrections up to order L = 3, the renormalized part of the action takes the form of an effective 
interaction term 1121 . i.e. 



^-^>[x<(T)] ^ ^-f^dT vfffix^i^)] ^23) 

where 



lfDo{l-b)\\. Dl{l-b') 



nba J ^ 3n(bQ) 

Note that the first line is the leading order term (i.e. L = 1) , while the second and third lines display the order L = 2 and L = 3 
corrections, respectively. 

We emphasize that the result of the EST construction is a new expression for the same path integral ( p3] ), in which the UV 
cutoff been lowered from Q.io bO.. Equivalently, the path integral is discretized according to a larger elementary time step, 
bdjb: 

J At JAt/b 

In these expressions, the symbol denotes the fact that the path integral is discretized according to an elementary time step 
Ar and we have suppressed the subscript "<", in the paths. It can be shown that the proportionality factor between Z^{t) and 

Z^lj{t) depends only on t and does not contribute to the statistical averages. 

We conclude this section be emphasizing that the EST is expected to accurately describe only the long-time dynamics, while its 
accuracy brakes down at sufficiently short- times. This type of failure represents a common feature to all statistical or quantum 
effective field theories. For example, the multipole expansion for the electric field generated by a finite charge distribution 
becomes highly inaccurate at short distances, of the order of the charge distribution size. 
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IV. THE EFFECTIVE LANGEVIN EQUATION 



The EST presented in the previous section provides a low time resolution path integral representation of the stochastic dynam- 
ics generated by the LE ([T]), in which the fast dynamics has been integrated out and replaced by a new effective interaction term, 
y^y^y^(X) involving only the slow components of the paths. The goal of this section is to prove that such a low time resolution dy- 
namics can be equivalently formulated in terms of an effective Langevin Eq., in which the effects of the fast modes is implicitly 
taken into account through a systematically calculable correction to the diffusion coefficient. To this goal, we first formulate an 
ansatz for such an effective Langevin Eq., and then determine a condition which follows from requiring that our equation should 



generate the path integral (25 ), which defines the EST. 

Let us therefore discuss the stochastic dynamics generated by the Eq. 

X = Vf/(X,) + (1 - a)VD{Xi) + v/2D(X) 



(26) 



where a is a insofar undefined constant whose origin will be discussed shortly, r\{t) is the usual delta-correlated Gaussian noise 
and D{X) is a position-dependent diffusion coefficient in the form 



D{X)=Do^d{X). 



(27) 



In the following, the function d{X) and its Laplacian will be assumed to be small, compared to the corresponding hard scales: 



J(X) 
Do 



«1, 



V2j(X) 



<L 



(28) 



We shall now determine a condition on the function d{X) which follows from requiring that arbitrary time-dependent averages 
of configuration-dependent observables generated by Eq. ([26]) coincide with those calculated in the EST, i.e. using the renor- 
malized path integral Eq. (25 ). Note that, in the right hand side of Eq. (26 ) we have introduced the term (1 — a)VZ)(X), which 
does not appear in the ordinary LE ([T]). In order to clarify the motivation for such a term, we need to make a short digression on 
the so called "Ito-Stratonovich dilemma", which affects stochastic differential Eq.s with multiplicative noise. Following closely 
the discussion in |[T3l , let us consider a generic stochastic differential Eq. in the form 

X = f(X)+g(X)Ti(0. (29) 
In such a family of Eq.s, an ambiguity arises from the fact that the value of the integral which appears in its formal solution, 

rt+At 



dsg[x{s)]^{s), 



(30) 



depends on wether the function g(X(s)) is evaluated before or after the action of the random force T\{t). The most general 
definition of the integral ([30jl can be cast in the form 



i-t+At 

-a)X(0] 



ds^{s), 



(31) 



where the real number < a < 1 specifies the prescription used to define the stochastic Calculus. For example, a = leads to 
an Ito Calculus, while cx = 5 corresponds to a Stratonovich Calculus. 

For a generic Eq. in the form ^29\, the ambiguity ( 3 1 ( is not resolved in the continuum limit. That is to say that different 



choices of a lead in general to different Fokker-Planck Eq.s. However, in the specific case of the Eq. (26), the ambiguity is 
resolved by the addition of the term (1 — ot) V£)(x), which assures that, for any choice of prescription a, the resulting probability 
density obeys the same Fokker-Planck Eq. lfT3l . 



dt 



DiX) (^Vt/(X)- 



PiX,t) 



(32) 



In addition, it can be shown that the solution of such a Fokker-Planck Eq. converges to the correct Boltzmann's distribution, in 
the long-time limit lTSl : 



P{X,t)^^ const. X exp 



U(X)_ 



(33) 
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While the Fokker-Planck Eq. associated to Eq. ( [26| ) is independent on the choice of a, the path integral representation of the 
conditional probability P(X/,^ |X;, 0) depends such a parameter and reads 



P(X„.|X,0) =/2>Xexp[-^'.x^(x- 

^t D{X) 



.^V(/(X)-(l-2a) VD{X)\ 



+aV- 



where (Z)X is a position-dependent measure: 



Nt 



V[/(X)+VZ)(X) 



1 



(34) 



(35) 



In the following, we shall adopt the Ito convention a = 0, in which the stochastic differential Eq. (26 ) is defined by the rule: 

(36) 



X,-+i = X,- - ^^-^^ yU{Xi)^^t VD{Xi) + V2Z)(X) At Xi. 
Kb I 



The periodic path integral generated by such an Eq. is 

dNi 

Nt 

n 

i=i 



^ELEit) 



n / ^o^i 

2D{Xi) 



1 

AnMD{Xi) 



. exp 



Y 4£)(X,)Af 



4(^Br)2 



VZ)(X,-)- 



1 ,2, A? 



4Z)(X,) 



(V£)(X,-))'Af- 



VC/(XO-VZ)(X,-) 



(37) 



We now show that the terms appearing in the second line provide sub-leading contributions in the expansion scheme (28). To 
this end, we first observe that the Langevin Eq. (|26jl implies that, on average, 



At 
2k^ 



V[/(X,-)-VZ)(XO: 



^ (X,+i - X,-), •V£)(Xi) - ^^^VD{Xi)\ 



2D{Xi) 



Substituting this into Eq. p7] l we find 

dNp 



2£>(X,) 



-I%^+%^vt/(x,) 



(38) 



4D{Xi)At 



2kBT 



(Vf/(X,-))'Ar- 



4£>(X,) 



(VZ)(X,-))2a/ 



Taking the continuum limit, and recalling the fundamental theorem of the Ito Calculus — Eq. ^ — we obtain 

X2 , D{X) ,„„,^,,2 i3(X)^,...„. 1 



ZELE{t) 



DX exp 
2)X exp 



/' 

Jo 

■/' 

Jo 



dx 
dx 



4£)(X) A{kBTf 
X? D{X) 



4D{X) A{kBTf 

which, to leading order in d(X), reads 

ZELE{t) = j>i)X exp - j'^ dx 



(Vt/(X))2 - ^^V2t/(X) - L>(X) 



2kBT 



4Z)(X) 
1 
4 



£'0 



X2 £>(X) 
4D(X) + A{kBTY 



{VU{X)f - ^V^U{X) + ^V2J(X) 



2^:8^ 



4Z)n 



(39) 



(40) 



(41) 



(42) 



The last term in the exponent is the contribution coming from the second line of Eq. ( [37] ) and is of higher order in the expansion 
scheme ([28]) and therefore can be neglected. 
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Finally, using Einstein's relationship, it is immediate to show that to the same order in our expansion scheme, the correction 



to the freely diffusive term 



4(Z)o+J(X)) 



cancels out with the corresponding correction to the measure 



Nt 



1 



47iAt {Do ^d{Xi)) 



(43) 



Hence, we arrive to the compact expression for the periodic path integral associated to the effective Langevin Eq. defined by Eq. 



ZELE{t) ^ ^^Z)Xexp -f^ dx ^+y,//(x)+j(x)y,//(x) 



(44) 



We now impose the condition that the effective Langevin Eq. ([36]) should generate the same stochastic dynamics of the EST, 
i.e. the same periodic path integral: 



^ELEit) = Zest it), Vr. 



This condition can be re- written as 

(exp 



dxVeff{X) diX) 



) = ( exp 



/' 

Jo 



Do / dxVfMX) 



(45) 



(46) 



where the notation (•) denotes the average performed over the ensemble of periodic paths generated by the ordinary LE ([T](. 
Since such an Eq. must hold for any total time interval t, then 



(exp[-y,^^[X(x)] d[xm ^ (exp [-Do n^//[X(x)]] ), Vx G [0,^]. 



(47) 



Now we recall that the renormalized part of the potential y^y^y^(X) and the renormalized part of the diffusion coefficient d{X) 
provide small corrections to the stochastic dynamics generated by the ordinary LE. Hence, we can expand the exponentials to 
leading order and obtain: 



{Veff{X{x)) d{xm^Do (y5/(X(x))), Vx e [0,t]. 



(48) 



Eq. (48) represents a condition on the correction term to the diffusion constant d{X) which is sufficient to ensure that the 
stochastic differential Eq. ([36]) generates the low time resolution dynamics of the EST. The main assumptions made to derive it 
are the existence of a separation of time scales, and the related expansion scheme ([28). 

Now we make one further approximation, which consists in describing the correction to the diffusion coefficient d{X) in the 
mean-field approximation, i.e. 



which implies 



{Veff{X{x)) J(X(X))) (y.//(X(x))) (J(X(X))), 

(Kyx(x))) 



{d{z)))^Do 



{Veff{X{x)))' 



(49) 



(50) 



We stress the fact that the averages involved in Eq.([50]) is performed over all periodic paths generated by the ordinary LE. It 
depends on time, but does not depend on the position. We also note that Eq.(50) diverges if the average value of Veff{X{z)) 
vanishes at some time x. In principle, this problem may be cured by adopting some regularization prescription. However, in 
practice, for all systems we have considered, the average of the effective potential was always found to be a negative number, 
for all times x. This is because the stochastic trajectories are most likely to visit regions of configuration space where the force 
is small, and the effective potential is dominated by the Laplacian contribution — cfr. Eq. ([14 ) — . 

Within the mean-field approximation for the diffusion coefficient, the Ito Eq. ( [36] ) which defines the effective Langevin Eq. 
reduces to one with a non-multiplicative noise: 



Xj+i = X; - 



At{Do^{d{i))) 



VU{Xi) + ^/2{Do^{d{i)))At ^i. 



(51) 



From Eq. ( [5T] ) it is manifest that, in the mean-field approximation ( [50] ) and to the lowest order in the expansion scheme ( [28] ), 
the dynamics of the fast modes can be integrated out by means of a time-dependent rescaling of the time intervals: 



Do{ti^i-ti) {Do^{d{i))) {ti^i-ti), 



{ti+i-ti) 



1 



1 + (J(/)) 



At. 



(52) 



Based on such observation, we are finally in a condition to define a simple three- step algorithm, which yields the stochastic 
dynamics at low time resolution power: 
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X 



FIG. 1: Left panel: the double well-potential of the one-dimensional toy model. Right panel: the corresponding L=0 (i.e. bare) and L=l 
contributions to the effective potential (The L=l contribution is not on scale) 



One generates an ensemble of stochastic paths by integrating the LE, using a large time step Ati = At/b. Here, At is a 
small integration time step interval for which the LE is convergent. This means that the dynamics which occurs at time 
scales smaller than At is not expected to contribute to the process under investigation. 

Such paths are used to compute the average time-dependent correction to the diffusion coefficient {d{x)), according to 
Eq. ( [5Q| . The frequency cut-off in the effective theory, bO,, which enters in the definition of the renormalized effective 
potential V^j^j^{X) — cfr. Eq. (24) — is related to the large integration time step by 



2% 2% 

ba = b = (53) 

At AtL 



3. Each time intervals between each consecutive instants ti^i and ti are rescaled according to Eq. ([52]), i.e. by the dilation 
factor provided by the renormalized average diffusion coefficient. 

It is important to note that, in general, the average value of the configuration-dependent operators is expected to vary over time 
scales which are of the order of the thermal equilibrium relaxation time, i.e. typically several orders of magnitude larger than 
the time scales associated to the slowest local microscopic conformational changes. This means that, in numerical simulations, 
the value of the mean-field correction to the diffusion coefficient, {d{x)) evolves very slowly, hence it needs to be updated only 
after a large number of elementary integration time steps. In the following, we shall refer to this algorithm as to the effective 
Langevin Eq. (ELE) approach. 



V. TWO ILLUSTRATIVE APPLICATIONS 



In this section, we illustrate and test the ELE approach proposed in the previous section. We begin by discussing a simple 
toy model, consisting of a point-particle diffusing in an external one-dimensional potential, and then apply the same approach to 
investigate the unfolding of a small protein, at a high temperature. 



A. Diffusion in a one-dimensional double-well 



Let us consider the diffusion of a point-particle in the external double- well potential 

U{x)=a{l-x^f. (54) 

We have chosen a system of units in which a = 1, Z)o = 1 and P = 1/(^57) = 5. The potential U (x), the corresponding effective 
potential Veff{x) and the renormalized effective potential V^j^j^{x) evaluated to order L = 1 for this system are plotted in Figll 
The dynamics of such a system is characterized by a decoupling of time scales, since the quasi-free diffusion in the bottom o:' 
the wells is much slower than the crossing of the transition regions, where the force is large. 
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FIG. 2: Time evolution of the average position of a point particle diffusing in a one dimensional double- well potential. The upper panel 
represents the time evolution at long times, when the ELE is expected to be accurate. The lower panel displays the evolution at short times, 
where the ELE becomes inaccurate. The red line is the result of the integration of the ordinary LE with a large integration step A^^ = 0.04, the 
black line is the result of integrating the LE with a small time step = 0.002 and the green line is the result of solving ELE with the large 
integration time step A^l = 0.04. 



Our goal is to compare the predictions for the time evolution of the average position {x{t)) obtained in the effective theory 
defined by the ELE and in the original theory, defined by the ordinary LE. We have generated two sets of 90,000 trajectories, 



10 



with initial condition, x{0) = — 1. The two ensembles of trajectories were obtained by integrating the LE ([T]), using two different 
elementary time steps: a "small" one, At = 0.002, and a "large" one, Ati = 0.04. Notice that At is 20 times smaller than Ati. 
The results are reported in Fig. [2] where the time evolution of {x{t)) is shown at short times (lower panel) and long times (upper 
panel). Both panels show that the results of the straightforward integration of the LE using the large integration time step At are 
inconsistent with those obtained integrating the same equation, using the small integration time step Ati. Hence, the short- time 
dynamics which is cut-off by the large time step cannot be neglected. 

On the other hand, in the ELE, such a fast dynamics is not neglected, but it is effectively taken into account at the mean-field 
level, through the time-dependent correction to the diffusion constant {d{t)). Such a term was calculated from Eq.(50 ) using the 
Langevin trajectories obtained with the large time step, and is plotted in Fig. |3] We see that the integration of the fast modes 
leads to an effective slow-down of the dynamics. Indeed, in the ELE all the time steps become approximatively 5% longer. In 
the upper panel of Fig. [2] we can see that, after applying the rescaling transformation ([52]), one recovers an excellent agreement 
with the predictions obtained integrating directly the LE with the small time step. 

We emphasize again that all effective field theories (including our EST, or equivalently the ELE) are expected to accurately 
describe only the long-time (i.e. infra-red) dynamics. They are not expected to provide reliable descriptions of the time evolution 
of the system in the short-time (i.e. ultra-violet) regime. This feature is clearly evident in the lower panel of |2j where we show 
the evolution of {x{t)) at short times. We see that for ^ ^ 10 the results of the ELE calculation obtained with large integration 
time step Ati start to deviate from the results obtained from the LE, with the small integration time step A^. This is regime where 
our effective theory breaks down. On the other hand, for ^ ^ 15 the ELE curve approaches the LE results obtained with the small 
integration time step. In such a regime, the ELE provides an excellent description of the dynamics. 



B. High temperature protein denaturation. 

As a second test of the ELE approach, we study the unfolding of the 16-residue C- terminus of protein GBl shown in Fig|4] 
This system was used as a test system in our previous studies, in the dominant reaction pathways approach 1 15 |. We adopt a 
coarse-grained Go-type model 1 16], in which the explicit degrees of freedom are beads which represent the single amino-acids. 
The energy function of this model is assumed to be the sum of pair- wise interactions: 



U 



(X) = ^i:^(|r,+i-r,|-^)2 + i^4£ 



2t- 
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(55) 



(2 = 0.38 nm represents the average distance between two consecutive a-carbons and k = 1000 kJ mo/~^nm~^ is the elastic 
constant of the harmonic spring. The strength of the Lennard- Jones attraction is set by the parameter 8 = 4 kJ mol~\ while 
G = 03 nm represents an effective residue size. Gij is the matrix of native contacts, i.e. Gij is set to 1 if the distance between 
the residues / and j in the native conformation is less than 0.65 nm, and otherwise. 

In the left panel of Fig. |5]we show the average time evolution for 200 ps, of the fraction of native contacts of this chain, 
starting from a configurations close to the experimentally measured native configuration. The average was performed over 900 
independent trajectories, generated by integrating the LE with a time step dt = 0.002 ps, at a temperature T = 200K and diffusion 




Time 



FIG. 3: Calculated time evolution of the (d{x{t))) in the one-dimensional toy model define by Eq. |54| 
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FIG. 4: 16-residue C-teraiinus of protein G-Bl (PDB code 2gbl). 




FIG. 5: Left panel: Time evolution of the fraction of native contacts of the p-hairpin, at the temperature T = 200K. Right panel: Comparison of 
the results of the LE and ELE simulations for the time evolution of the fraction of native contacts, for the 16-residue poly-peptide chain shown 
in Fig.|4] 



constant Dq = 0.8nm^ps~^ . We see that, at such a low temperature, the experimentally measured native state remains stable. On 
the other hand, at high temperatures, the native structure is thermodynamically unstable and the protein spontaneously unfolds. 

We computed the time evolution of the average fraction of native contacts, during the high temperature unfolding reaction. 
The purpose of this section is to assess the validity of the ELE method, by comparing the results of the microscopic calculation 
obtained from the LE with a small integration time step At = 0.002 ps with those obtained from the ELE, using a 20-fold larger 
integration time step, Ati = 0.05 ps. We have observed that, for time steps smaller than 0.002 ps, the results of the Langevin 
simulations are independent on the choice of the integration time step, within the statistical errors. 

In analogy with the previous one-dimensional example, we have generated two sets of 900 independent trajectories at 
T = 300K and Dq = 1.2nm^ps~^ by integrating the LE with the small and large time steps, starting from the experimentally 
determined native state. Fig. |5] shows that a small yet statistically significant discrepancy is observed between the results of the 
LE with time steps At, and Ati. Indeed, the results of the LE obtained with the large integration time step fall consistently short 
of the corresponding points obtained with the small integration time step. As in the previous one-dimensional example, this is a 
clean signature of the fact that the dynamics which occurs at the time scale of 10~^ ps cannot be simply cut-off. However we can 
see that, once such a dynamics is effectively taken into account through the ELE, the agreement with the long-time LE results 
obtained using the small integration time step is recovered. We also note that our effective theory breaks down in the short-time 
regime, as expected. However, such a limitation of the ELE method does not represent a problem, in practical applications, since 
the short-time molecular dynamics can be very efficiently simulated using the existing algorithms. 



VI. CONCLUSIONS 



In this work we have introduced an effective theory based on a first-order stochastic differential equation which describes the 
microscopic molecular dynamics, at a low time resolution power. In such an approach, the effects of the fast dynamics which is 
excluded by using a large integration time step are implicitly accounted for by means of an effective time-dependent diffusion 
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constant, which is derived using the RG approach. The assumptions used in the derivation are: (i) the existence of a gap in the 
internal dynamical time scales of the system and (ii) the mean-field approximation in the calculation of the effective diffusion 
constant. 

We have illustrated and validated our method by studying the diffusive dynamics of a one-dimensional toy-model and of a 
simple coarse-grained model for a protein fragment. In both cases, we have found that our effective theory yields the correct 
long-time dynamics, even when one uses an integration time step which is 20 time larger than the one used in the ordinary 
Langevin simulations. 

The present preliminary study did not aim to systematically assess the accuracy of the ELE approach for realistic molecular 
models, nor to accurately estimate the computational gain which can be achieved by simulating the effective theory, rather than 
the full theory. To this purpose, one should perform a systematic analysis based on realistic atomistic models, which include also 
three-body and four-body potential to account for bonded interactions, along with non-bonded electrostatic forces and solvent 
induced interactions. The first preliminary results reported here serve as a motivation for such an analysis. 
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